Coping with disfluencies in spontaneous speech recognition

نویسندگان

Frederik Stouten

Jean-Pierre Martens

چکیده

Nowadays, automatic speech recognizers have become quite good in recognizing well prepared fluent speech (e.g. news readings). However, the recognition of spontaneous speech is still problematic. Some important reasons for this are that spontaneous speech is usually less articulated and contains a lot of disfluencies. In this paper, a new methodology for coping with disfluencies is presented and evaluated. The basic idea is to detect disfluencies and to determine the nature of these disfluencies prior to the recognition, and to use that information to control/modify the search. At present, the methodology has been elaborated for filled pauses (FP) and word repetitions (WR). It enables us to eliminate about one associated normal word error per disfluency without introducing a significant augmentation of the computational load.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Coping with disfluencies in spontaneous speech recognition: Acoustic detection and linguistic context manipulation

Nowadays read speech recognition already works pretty well, but the recognition of spontaneous speech is much more problematic. There are plenty of reasons for this, and we hypothesize that one of them is the regular occurrence of disfluencies in spontaneous speech. Disfluencies disrupt the normal course of the sentence and when for instance word interruptions are concerned, they also give rise...

متن کامل

Benefits of Disfluency Detection in Spontaneous Speech Recognition

متن کامل

Handling Disfluencies in Spontaneous Language Models

In automatic speech recognition, a stochastic language model (LM) predicts the probability of the next word on the basis of previously recognized words. For the recognition of dictated speech this method works reasonably well since sentences are typically well-formed and reliable estimation of the probabilities is possible on the basis of large amounts of written text material. However, for spo...

متن کامل

Automatic Detection and Removal of Disfluencies from Spontaneous Speech

Unlike rehearsed and prepared speech, spontaneous speech contains high occurrence of disfluencies, like repetitions, filled pauses, and hesitations. Disfluencies can seriously hamper the word recognition accuracy of an Automatic Speech Recogniser (ASR), by increasing word insertion and deletion and rejection rates. In this paper we introduce signal processing algorithms to automatically identif...

متن کامل

Evaluation of sublexical and lexical models of acoustic disfluencies for spontaneous speech recognition in Spanish

Spontaneous speech is full of acoustic disfluencies that rarely appear in read or laboratory speech. A very simple and straightforward approach is presented, in which acoustic disfluences are modelled by augmenting the inventory of sublexical units, which originally consisted of 23 context independent phones plus a special unit for silent pauses. This set was augmented with 12 additional units ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2004

Coping with disfluencies in spontaneous speech recognition

نویسندگان

چکیده

منابع مشابه

Coping with disfluencies in spontaneous speech recognition: Acoustic detection and linguistic context manipulation

Benefits of Disfluency Detection in Spontaneous Speech Recognition

Handling Disfluencies in Spontaneous Language Models

Automatic Detection and Removal of Disfluencies from Spontaneous Speech

Evaluation of sublexical and lexical models of acoustic disfluencies for spontaneous speech recognition in Spanish

عنوان ژورنال:

اشتراک گذاری